AITopics | radiological image

Collaborating Authors

radiological image

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Regional Attention-Enhanced Swin Transformer for Clinically Relevant Medical Image Captioning

Naz, Zubia, Asghar, Farhan, Hussain, Muhammad Ishfaq, Hadadi, Yahya, Rafique, Muhammad Aasim, Choi, Wookjin, Jeon, Moongu

arXiv.org Artificial IntelligenceNov-14-2025

Automated medical image captioning translates complex radiological images into diagnostic narratives that can support reporting workflows. We present a Swin-BART encoder-decoder system with a lightweight regional attention module that amplifies diagnostically salient regions before cross-attention. Trained and evaluated on ROCO, our model achieves state-of-the-art semantic fidelity while remaining compact and interpretable. We report results as mean$\pm$std over three seeds and include $95\%$ confidence intervals. Compared with baselines, our approach improves ROUGE (proposed 0.603, ResNet-CNN 0.356, BLIP2-OPT 0.255) and BERTScore (proposed 0.807, BLIP2-OPT 0.645, ResNet-CNN 0.623), with competitive BLEU, CIDEr, and METEOR. We further provide ablations (regional attention on/off and token-count sweep), per-modality analysis (CT/MRI/X-ray), paired significance tests, and qualitative heatmaps that visualize the regions driving each description. Decoding uses beam search (beam size $=4$), length penalty $=1.1$, $no\_repeat\_ngram\_size$ $=3$, and max length $=128$. The proposed design yields accurate, clinically phrased captions and transparent regional attributions, supporting safe research use with a human in the loop.

caption, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.09893

Country:

Asia > South Korea (0.15)
Asia > Middle East > Saudi Arabia (0.14)

Genre: Research Report > Experimental Study (0.48)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)

Add feedback

RadSAM: Segmenting 3D radiological images with a 2D promptable model

Khlaut, Julien, Ferreres, Elodie, Tordjman, Daniel, Philippe, Hélène, Boeken, Tom, Manceron, Pierre, Dancette, Corentin

arXiv.org Artificial IntelligenceApr-30-2025

Medical image segmentation is a crucial and time-consuming task in clinical care, where mask precision is extremely important. The Segment Anything Model (SAM) offers a promising approach, as it provides an interactive interface based on visual prompting and edition to refine an initial segmentation. This model has strong generalization capabilities, does not rely on predefined classes, and adapts to diverse objects; however, it is pre-trained on natural images and lacks the ability to process medical data effectively. In addition, this model is built for 2D images, whereas a whole medical domain is based on 3D images, such as CT and MRI. Recent adaptations of SAM for medical imaging are based on 2D models, thus requiring one prompt per slice to segment 3D objects, making the segmentation process tedious. They also lack important features such as editing. To bridge this gap, we propose RadSAM, a novel method for segmenting 3D objects with a 2D model from a single prompt. In practice, we train a 2D model using noisy masks as initial prompts, in addition to bounding boxes and points. We then use this novel prompt type with an iterative inference pipeline to reconstruct the 3D mask slice-by-slice. We introduce a benchmark to evaluate the model's ability to segment 3D objects in CT images from a single prompt and evaluate the models' out-of-domain transfer and edition capabilities. We demonstrate the effectiveness of our approach against state-of-the-art models on this benchmark using the AMOS abdominal organ segmentation dataset.

artificial intelligence, machine learning, segmentation, (15 more...)

arXiv.org Artificial Intelligence

2504.20837

Country: Europe (0.46)

Genre: Research Report > Promising Solution (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training

Lu, Zhixiu, Li, Hailong, He, Lili

arXiv.org Artificial IntelligenceMar-14-2024

The integration of artificial intelligence (AI) with radiology has marked a transformative era in medical diagnostics. Vision foundation models have been adopted to enhance radiologic imaging analysis. However, the distinct complexities of radiological imaging, including the interpretation of 2D and 3D radiological data, pose unique challenges that existing models, trained on general non-medical images, fail to address adequately. To bridge this gap and capitalize on the diagnostic precision required in medical imaging, we introduce RadCLIP: a pioneering cross-modal foundational model that harnesses Contrastive Language-Image Pre-training (CLIP) to refine radiologic image analysis. RadCLIP incorporates a novel 3D slice pooling mechanism tailored for volumetric image analysis and is trained using a comprehensive and diverse dataset of radiologic image-text pairs. Our evaluations demonstrate that RadCLIP effectively aligns radiological images with their corresponding textual annotations, and in the meantime, offers a robust vision backbone for radiologic imagery with significant promise.

dataset, radclip, representation, (14 more...)

arXiv.org Artificial Intelligence

2403.09948

Country: Europe > Switzerland (0.04)

Genre: Research Report (0.90)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images

VanBerlo, Blake, Hoey, Jesse, Wong, Alexander

arXiv.org Artificial IntelligenceSep-5-2023

Self-supervised pretraining has been observed to be effective at improving feature representations for transfer learning, leveraging large amounts of unlabelled data. This review summarizes recent research into its usage in X-ray, computed tomography, magnetic resonance, and ultrasound imaging, concentrating on studies that compare self-supervised pretraining to fully supervised learning for diagnostic tasks such as classification and segmentation. The most pertinent finding is that self-supervised pretraining generally improves downstream task performance compared to full supervision, most prominently when unlabelled examples greatly outnumber labelled examples. Based on the aggregate evidence, recommendations are provided for practitioners considering using self-supervised learning. Motivated by limitations identified in current research, directions and practices for future study are suggested, such as integrating clinical knowledge with theoretically justified self-supervised learning methods, evaluating on public datasets, growing the modest body of evidence for ultrasound, and characterizing the impact of self-supervised pretraining on generalization.

diagnostic task, radiological image, self-supervised pretraining

arXiv.org Artificial Intelligence

2309.02555

Genre:

Overview (1.00)
Research Report (0.69)

Industry:

Health & Medicine > Nuclear Medicine (0.40)
Health & Medicine > Diagnostic Medicine > Imaging (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.93)

Add feedback

What Does DALL-E 2 Know About Radiology?

Adams, Lisa C., Busch, Felix, Truhn, Daniel, Makowski, Marcus R., Aerts, Hugo JWL., Bressem, Keno K.

arXiv.org Artificial IntelligenceSep-27-2022

Generative models such as DALL-E 2 could represent a promising future tool for image generation, augmentation, and manipulation for artificial intelligence research in radiology provided that these models have sufficient medical domain knowledge. Here we show that DALL-E 2 has learned relevant representations of X-ray images with promising capabilities in terms of zero-shot text-toimage generation of new images, continuation of an image beyond its original boundaries, or removal of elements, while pathology generation or CT, MRI, and ultrasound images are still limited. The use of generative models for augmenting and generating radiological data thus seems feasible, even if further fine-tuning and adaptation of these models to the respective domain is required beforehand. DALL-E 2 is a novel deep learning model for text-to-image generation, first introduced by OpenAI in April 2022 [Ramesh et al., 2022]. The model has recently gained widespread public interest due to its ability to create photorealistic images solely from short written inputs [Kather et al., 2022][Conwell and Ullman, 2022][Marcus et al., 2022].

artificial intelligence, dall-e 2, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.2196/43110

2209.13696

Country:

Europe > Germany > Berlin (0.16)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
Europe > Netherlands > Limburg > Maastricht (0.05)
(4 more...)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Applications of Manifolds in Machine Learning and Deep Learning(Artificial Intelligence +…

#artificialintelligenceJul-28-2022, 06:03:36 GMT

Abstract: Admittedly, Graph Convolution Network (GCN) has achieved excellent results on graph datasets such as social networks, citation networks, etc. However, softmax used as the decision layer in these frameworks is generally optimized with thousands of iterations via gradient descent. Furthermore, due to ignoring the inner distribution of the graph nodes, the decision layer might lead to an unsatisfactory performance in semi-supervised learning with less label support. To address the referred issues, we propose a novel graph deep model with a non-gradient decision layer for graph mining. Firstly, manifold learning is unified with label local-structure preservation to capture the topological information of the nodes.

machine learning and deep learning, manifold, radiological image, (10 more...)

#artificialintelligence

Industry: Health & Medicine (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.43)

Add feedback

The Intrinsic Manifolds of Radiological Images and their Role in Deep Learning

Konz, Nicholas, Gu, Hanxue, Dong, Haoyu, Mazurowski, Maciej A.

arXiv.org Artificial IntelligenceJul-6-2022

The manifold hypothesis is a core mechanism behind the success of deep learning, so understanding the intrinsic manifold structure of image data is central to studying how neural networks learn from the data. Intrinsic dataset manifolds and their relationship to learning difficulty have recently begun to be studied for the common domain of natural images, but little such research has been attempted for radiological images. We address this here. First, we compare the intrinsic manifold dimensionality of radiological and natural images. We also investigate the relationship between intrinsic dimensionality and generalization ability over a wide range of datasets. Our analysis shows that natural image datasets generally have a higher number of intrinsic dimensions than radiological images. However, the relationship between generalization ability and intrinsic dimensionality is much stronger for medical images, which could be explained as radiological images having intrinsic features that are more difficult to learn. These results give a more principled underpinning for the intuition that radiological images can be more challenging to apply deep learning to than natural image datasets common to machine learning research. We believe rather than directly applying models developed for natural images to the radiological imaging domain, more care should be taken to developing architectures and algorithms that are more tailored to the specific characteristics of this domain. The research shown in our paper, demonstrating these characteristics and the differences from natural images, is an important first step in this direction.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-031-16452-1_65

2207.02797

Country:

North America > United States (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium > Flanders (0.04)

Genre: Research Report > New Finding (0.47)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multimodal AI in Healthcare: Closing the Gaps

#artificialintelligenceJun-20-2021, 05:20:37 GMT

Healthcare professionals, in their daily routine, make use of multiple sources of data. To arrive to a diagnosis and decide on patient management, they rely on a combination of several types and sources of data: imaging (e.g., Radiology, Pathology, Ophthalmology), time series (e.g., electrocardiograms -- ECG), structured clinical data (e.g., vital signs, lab results) and non-structured data (e.g., clinical notes). Considering the level of expertise required to understand in depth one single data type, it is close to impossible for a single healthcare professional to master all areas. A radiologist has specialized training to read radiological images, but doesn't know as much about internal medicine or surgery. A cardiologist has a deep understanding of ECGs, but normally does not know how to evaluate a pathology slide.

data type, healthcare, multimodal ai, (10 more...)

#artificialintelligence

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.93)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.56)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Applied AI (0.66)

Add feedback

Current use cases for machine learning in healthcare

#artificialintelligenceJul-27-2018, 15:31:53 GMT

Machine learning (ML) is causing quite the buzz at the moment, and it's having a huge impact on healthcare. Payers, providers, and pharmaceutical companies are all seeing applicability in their spaces and are taking advantage of ML today. This is a quick overview of key topics in ML, and how it is being used in healthcare. A machine learning model is created by feeding data into a learning algorithm. The algorithm is where the magic happens.

algorithm, artificial intelligence, machine learning, (13 more...)

#artificialintelligence

Country: North America > United States (0.05)

Industry:

Health & Medicine > Nuclear Medicine (0.79)
Health & Medicine > Pharmaceuticals & Biotechnology (0.78)
Health & Medicine > Diagnostic Medicine > Imaging (0.57)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback